147 results found.
Speech/Written
Lexicon,
Language Type:
Monolingual
Languages:
Czech
Availability:
Freely Available
License:
CreativeCommons
Size:
480 MByte Production Status:
Newly created-in progress
Use:
-
Paper title:Prague Dependency Treebank - Consolidated 1.0
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Marie Mikulová | MorfFlex CZ | /N |
Documentation:
https://ufal.mff.cuni.cz/morfflex
Written
Lexicon,
Language Type:
Monolingual
Languages:
Czech
Availability:
Freely Available
License:
Creative Commons
Size:
205 entries Production Status:
Existing-updated
Use:
Discourse
-
Paper title:CzeDLex 0.6 and its Representation in the PML-TQ
-
Paper track:Written/poster presentation with demo
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jiří Mírovský | CzeDLex | /N |
Documentation:
https://ufal.mff.cuni.cz/czedparse/czedlex
Written
Corpus,
Language Type:
Monolingual
Languages:
Czech
Availability:
Freely Available
License:
CC BY
Size:
4262 sentences Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:COSTRA 1.0: A Dataset of Complex Sentence Transformations
-
Paper track:Evaluation/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Petra Barancikova | Costra 1.0 | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
Czech English French German
Availability:
Freely Available
License:
CreativeCommons
Size:
31014 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Supervised Visual Attention for Multimodal Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tetsuro Nishihara | Multi30k | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English French German Spanish Swedish
Availability:
Freely Available
License:
CreativeCommons
Size:
7 GByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Document Translation vs. Query Translation for Cross-Lingual Information Retrieval in the Medical Domain
-
Paper track:Long/Information Retrieval and Text Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shadi Saleh | Extended CLEF eHealth 2013-2015 IR Test Collection | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English French German Hungarian Polish Spanish Swedish
Availability:
Freely Available
License:
CreativeCommons
Size:
2 MByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Document Translation vs. Query Translation for Cross-Lingual Information Retrieval in the Medical Domain
-
Paper track:Long/Information Retrieval and Text Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shadi Saleh | Khresmoi Summary Translation Test Data 2.0 | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Bulgarian Croatian Czech Danish Dutch English Estonian Finnish French German Greek Hungarian Icelandic Irish Italian Latvian Lithuanian Maltese Polish Portuguese Romanian Slovak Slovenian Spanish Swedish
Availability:
Freely Available
License:
CC-0
Size:
341856530 sentences Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:ParaCrawl: Web-Scale Acquisition of Parallel Corpora
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Philipp Koehn | ParaCrawl | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Multilingual
Languages:
Czech
Availability:
Freely Available
License:
Creative Commons 3.0-BY-NC-SA
Size:
2730 lexemes Production Status:
Existing-updated
Use:
Lexicon extension
-
Paper title:To Pay or to Get Paid: Enriching a Valency Lexicon with Diatheses
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Anna Vernerová | Institute of Formal and Applied Linguistics, Charles University in Prague | CZ | Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University | CZ |
| Author 2 | Václava Kettnerová | Charles University in Prague | None | Institute of Formal and Applied Linguistics, Charles University in Prague | CZ |
| Author 3 | Marketa Lopatkova | Charles University in Prague | CZ | ||
| Main Contact | Anna Vernerová | Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University | None |
Documentation:
http://ufal.mff.cuni.cz/vallex/2.6/doc/structure_en.html
Speech/Written
Lexicon,
Language Type:
Multilingual
Languages:
Czech English Finnish French German Russian
Availability:
Freely Available
License:
Size:
206, 395 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Understanding Pure Character-Based Neural Machine Translation: The Case of Translating Finnish into English
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gongbo Tang | MuCow | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Afrikaans Albanian Amharic Arabic Aragonese Armenian Assamese Azerbaijani Basque Belarusian Bengali Bosnian Breton Bulgarian Burmese Catalan Central Khmer Chinese Croatian Czech Danish Dutch Dzongkha English Esperanto Estonian Finnish French Gaelic Galician Georgian German Greek Gujarati Hausa Hebrew Hindi Hungarian Icelandic Igbo Indonesian Irish Italian Japanese Kannada Kazakh Kinyarwanda Korean Kurdish Kyrgyz Latvian Limburgan Lithuanian Macedonian Malagasy Malay Malayalam Maltese Marathi Mongolian Nepali Northern Sami Norwegian Norwegian Bokmål Norwegian Nynorsk Occitan Oriya Panjabi Pashto Persian Polish Portuguese Romanian Russian Serbian Serbo-Croatian Sinhala Slovak Slovenian Spanish Swedish Tajik Tamil Tatar Telugu Thai Turkish Turkmen Uighur Ukrainian Urdu Uzbek Vietnamese Walloon Welsh Western Frisian Xhosa Yiddish Yoruba Zulu
Availability:
Freely Available
License:
Size:
55 million sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Biao Zhang | the open parallel corpus (OPUS) | /N |
Documentation:
None




